# High-precision Quantization
Cognitivecomputations Qwen3 72B Embiggened GGUF
Apache-2.0
A quantized version based on the cognitivecomputations/Qwen3-72B-Embiggened model, quantized using llama.cpp, and can run efficiently in various environments.
Large Language Model
C
bartowski
826
1
Allura Org Q3 30B A3B Designant GGUF
A Llamacpp imatrix quantized version based on allura-org/Q3-30B-A3B-Designant, suitable for various quantization needs, supporting role-playing and conversational tasks.
Large Language Model
A
bartowski
344
1
Pocketdoc Dans PersonalityEngine V1.3.0 12b GGUF
Apache-2.0
A 12B-parameter multilingual large language model based on llama.cpp quantization, supporting role-play, story creation, and multi-domain professional tasks
Large Language Model
P
bartowski
1,027
3
Allura Org Q3 30b A3b Pentiment GGUF
Q3-30b-A3b-Pentiment is a large language model based on the LLaMA architecture, optimized through quantization for various text generation tasks.
Large Language Model
A
bartowski
1,220
2
Primeintellect INTELLECT 2 GGUF
Apache-2.0
Quantized version of INTELLECT-2, optimized using llama.cpp, supporting multiple quantization types to accommodate different hardware requirements.
Large Language Model
P
bartowski
6,268
4
Cognitivecomputations Dolphin Mistral 24B Venice Edition GGUF
Apache-2.0
Llamacpp imatrix quantized version of Dolphin-Mistral-24B-Venice-Edition, supporting multiple quantization types, suitable for text generation tasks.
Large Language Model
C
bartowski
4,718
6
Goekdeniz Guelmez Josiefied Qwen3 8B Abliterated V1 GGUF
This is a quantized version of the Qwen3-8B model, using llama.cpp for iMatrix quantization, suitable for chat scenarios.
Large Language Model
G
bartowski
7,520
12
Allura Org Remnant Glm4 32b GGUF
Apache-2.0
Remnant-GLM4-32B is a 32B-parameter large language model based on the GLM4 architecture, supporting role-playing and conversational interactions, particularly suitable for salamander-related applications.
Large Language Model
A
bartowski
2,198
2
Qwen Qwen3 30B A3B GGUF
Apache-2.0
Quantized version based on Qwen/Qwen3-30B-A3B, using llama.cpp for multi-precision quantization, suitable for text generation tasks.
Large Language Model
Q
bartowski
79.34k
49
Glm 4 9b Chat Abliterated GGUF
Other
A 9B-parameter chat model based on GLM-4 architecture, supporting Chinese and English dialogues, quantized for various hardware environments
Large Language Model Supports Multiple Languages
G
bartowski
2,676
11
Beaverai MN 2407 DSK QwQify V0.1 12B GGUF
Apache-2.0
A large language model based on 12B parameters, supporting text generation tasks, released under the Apache-2.0 license.
Large Language Model
B
bartowski
1,547
5
Thedrummer Cydonia 24B V2 GGUF
Other
This is a 24B-parameter large language model, processed with llama.cpp's imatrix quantization, offering multiple quantized versions to suit different hardware requirements.
Large Language Model
T
bartowski
5,797
16
Nousresearch DeepHermes 3 Llama 3 8B Preview GGUF
A dialogue model fine-tuned based on Llama-3-8B, supporting multiple quantization versions, suitable for tasks such as chatting, reasoning, and role-playing.
Large Language Model English
N
bartowski
1,038
16
Nvidia AceInstruct 7B GGUF
A quantized version based on NVIDIA's AceInstruct-7B model, processed using llama.cpp with support for multiple quantization types, suitable for tasks in code, mathematics, and general domains.
Large Language Model
N
bartowski
196
3
Cognitivecomputations Dolphin3.0 R1 Mistral 24B GGUF
Dolphin3.0-R1-Mistral-24B is a 24B-parameter large language model based on the Mistral architecture, trained by Eric Hartford, focusing on reasoning and first-principles analysis.
Large Language Model English
C
bartowski
10.24k
72
Huihui Ai DeepSeek R1 Distill Llama 70B Abliterated GGUF
GGUF quantized version of DeepSeek-R1-Distill-Llama-70B-abliterated, suitable for local inference, offering multiple quantization options to meet different hardware requirements.
Large Language Model
H
bartowski
7,848
25
Deepseek R1 Distill Qwen 32B Abliterated GGUF
DeepSeek-R1-Distill-Qwen-32B-abliterated is a distilled version based on Qwen-32B, offering multiple quantization options to accommodate different hardware requirements.
Large Language Model
D
bartowski
20.10k
103
Featured Recommended AI Models